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acpS 



FIELD OF THE INVENTION 

This invention relates to newly identified polynucleotides and polypeptides, and their production and 
uses, as well as their variants, agonists and antagonists, and their uses. In particular, the invention relates to 
polynucleotides and polypeptides of the acyl carrier protein synthase family, as weU as their variants, herein 
referred to as "acpS," "acpS polynucleotide(s),'* and "acpS polypeptide(s)" as the case may be. 

BACKGROUND OF THE INVENTION 

The Streptococci make up a medically in:q)oitant genera of microbes known to cause several types of 
disease in humans, including, for example, otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, 
sinusitis, pleural empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. Since its isolation more than 100 years ago, Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For exanqDie, much of our early understanding that DNA is, n fact, the 
genetic material was predicated on the work of GriflBth and of Avery, Macleod and McCarty using this 
microbe. Despite the vast amount of research with S. pneumoniae, many questions concerning the virulence of 
this microbe remain. It is particularly preferred to wtploy Strqjtococcal genes and gene products as targets for 
the development of antibiotics. 

The fi^equency of Streptococcus pneumoniae infections has risen dramaticaUy in the past few decades. 
This has been attributed to the emergence of multiply antibiotic resistant strains and an increasing population of 
people with weakened immune systems. It is no longer uncommon to isolate Streptococcus pneumoniae strains 
that are resistant to some or all of the standard antibiotics. This phenomenon has created an unmet medical 
need and demand for new anti-microbial agents, vaccines, dnig screening methods, and diagnostic tests for this 
organism. 

Moreover, the drug discovery process is currently undergoing a fundamental revolution as it embraces 
"functional genomics," that is, high throughput genome- or gene-based biology. This approach is rapidly 
superseding earher approaches based on "positional cloning" and other methods. Functional genomics relies 
heavily on the various tools of bioinformatics to identify gene sequences of potential interest from the many 
molecular biology databases now available as weU as from other sources. There is a continuing and significant 
need to identify and characterize fiirther goies and odier polynucleotides sequences and their related 
pohpeptides, as targets for drug discovery. 
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Clearly, there exists a need for polynucleotides and polypeptides, such as the acpS embodiments of the 
invention, that have a present benefit of, among other things, being useful to screen compounds for 
antimicrobial activit}'. Such factors are also useful to determine their role in pathogenesis of infection, 
dysfunction and disease. There is also a need for identification and characterization of such factors and their 
antagonists and agonists to find ways to prevent, ameliorate or correct such infectioa dysfunction and disease. 

SUMMARY OF THE INVENTION 

The present invention relates to acpS, in particular acpS polypeptides and acpS polynucleotides, 
recombinant materials and methods for their production. In another aspect, the invention relates to methods for 
using such pol>peptides and polynucleotides, including treatment of microbial diseases, amongst others. In a 
further aspect, the invention relates to methods for identifying agonists and antagonists using the 
materials provided by the invention, and for treating microbial infections and conditions associated with 
such infections with the id^itified agonist or antagonist compounds. In a still further aspect, the invention 
relates to diagnostic assays for detecting diseases associated with microbial infections and conditions 
associated with such infections, such as assays for detecting acpS expression or activity. 

Various changes and modifications within the spirit and scope of the disclosed invention wiD become 
readily apparent to :uose skilled in the ait fi-om reading the following descriptions and fi-om reading the other 
parts of the present disclosure. 

DESCRIPTION OF THE INVENTION 

The invaition relates to acpS polypeptides and polynucleotides as described in greater detail below. In 
particular, the invention relates to polypeptides and polynucleotides of a acpS of Streptococcus pneumoniae, 
which is related by amino acid sequence homology to H.pylori acpS polypeptide. Hie invention relates 
especially to acpS having a nucleotide and amino acid sequences set out in Table 1 as SEQ ID NO:] and SEQ 
ID N0:2 respectively. Note that sequences recited in the Sequence Listing below as "DNA" represent an 
exemplification of the invention, since those of ordinary skill will recognize that such sequences can be 
usefiiUy employed in polynucleotides in general, including ribopolynucleotides. 



TABLE 1 

acpS Polynucleotide and Polypeptide Sequences 

(A) Streptococcus pneumoniae acpS polynucleotide sequence [SEQ ID N0:1]. 

5'- 

ATGATAGTTGGACACGGAATTGACATCGAAGAATTGGCTTCGATAGAAAGCGCAGTTACACGACATGAAGGA^ 
CTAAGCGTGTACTGACCGCTCAGGAAATGGAGCGCTTCACCAGTCTCAAAGGACGCAGGCAAATAGAATATTTAGC 
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TGGTCGCTGGTCGGCTAAGGAGGCCTTTTCCAAGGCTATGGGAACGGGCATTAGCAAGCTCGGTTTTCAGGATTTG 
GAAGTCTTGAAC 

AATGAACGTGGGGCGCCTTATTTTAGTCAGGCACCATTTTCPlGGAAAGATTTGGCTGTCTATCAGCCACACCGATC 
AGTTTGTGACAGCCAGTGTCATTTTGGAGGAAAATCATGAAAGCTAG-3 ' 

(B) Streptococcus pneumoniae aq)S polypeptide sequence deduced from a polynucleotide sequence 
in this table [SEQ ID N0:2]. 

NH2- 

MIVGHGIDIEEIASIESAVTRHEGFAKRVXTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMGTGISKLGFQDL 
EVLNNERGAP YFSQAP FSGKI WLS I SHTDQFVTASVILEENHES * -COOH 



Deposited materials 

A deposit comprising a Streptococcus pneumoniae 0100993 strain has been deposited with the 
National Collections of Industrial and Marine Bacteria Ltd. (herein ''NCIMB"), 23 St. Machar Drive, Aberdeen 
AB2 IRY, Scotland on 1 1 April 1996 and assigned deposit number 40794, The deposit was described as 
Streptococcus pneumoniae 0100993 on deposit. On 17 April 1996 a Streptococcus pneumoniae 0100993 
DNA library in £. coli was similarly dq>osited with the NCIMB and assigned deposit number 40800. The 
Streptococcus pneumoniae strain deposit is referred to herein as "the deposited r^' rain" or as "the DNA of the 
deposited strain." 

The deposited strain comprises a full length acpS gene. The sequence of the polynucleotides comprised 
in the deposited strain, as well as the amino add sequence of any polypeptide encoded thereby, are controlling in 
the event of any conflict with any description of sequences herein. 

The deposit of the deposited strain has been made under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Micro-oiganisms for Purposes of Patent Procedure. The deposited 
strain will be irrevocabh' and without restriction or condition released to the public upon the issuance of a 
patent. The deposited strain is provided merely as convenience to those of skill in the art and is not an 
admission that a deposit is required for aiablement, such as that required under 35 U.S.C. § 1 12. A license may 
be required to make, use or sell the deposited strain, and compounds derived therefrom, and no such license is 
hereby granted. 

In one aspect of the invention there is provided an isolated nucleic acid molecule encoding a mature 
polypeptide expressible by die Streptococcus pneumoniae 0100993 strain, which polypeptide is comprised in 
the deposited strain. Further provided by the invention are acpS polynucleotide sequences in the deposited 
strain, such as DNA and RNA, and amino acid sequences encoded thereby. Also provided by the invention are 
acpS poh'peptide and polynucleotide sequences isolated from the deposited strain. 



r 
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Polypeptides 

acpS polypeptide of the invention is substantially phylogenetically related to other proteins of the acyl 
carrier protein synthase family. 

In one aspect of the invaition there are provided polypeptides of Streptococcus pneumoniae referred 
5 to herein as "acpS" and "acpS polypeptides" as well as biologically, diagnostically, prophylactically, clinically 
or therapeutically usefiil variants thereof, and compositians comprising the same. 

Among the particularly preferred embodiments of the invention are variants of acpS polypeptide 
encoded by naturally occurring alleles of a acpS gene. 

The present invention further provides for an isolated polypeptide which; (a) comprises or consists of 
1 0 an ammo acid sequence which has at least 95% identity, most preferably at least 97-99% or exact 

identit>\ to that of SEQ ID N0:2 over the entire length of SEQ ID N0:2; (b) a polypeptide encoded by 
an isolated polynucleotide comprising or consisting of a polynucleotide sequence which has at least 95% 
identit\\ even more preferably at least 97-99% or exact identity to SEQ ID NO: 1 over the entire length of 
SEQ ID NO: 1 ; (c) a polypeptide encoded by an isolated polynucleotide comprising or consisting of a 
1 5 polynucleotide sequence encoding a polypeptide which has at least 95% identity, even more preferably at 
least 97-99% or exact identity, to the amino acid sequence of SEQ ID N0:2, over the entire length of SEQ 
ID /^0:2- 

The polypeptides of the invention include a polypeptide of Table 1 [SEQ ID N0:2] (in particular a 
mature polypeptide) as well as polypeptides and fragments, particularly those which have a biological activity 
20 of acpS, and also those which have at least 95% identity to a polypq)tide of Table 1 [SEQ ID N0:2] and also 
include portions of such polypeptides with such portion of the polypeptide generally conqjrising at least 30 
amino acids and more preferably at least 50 amino acids. 

The invention also includes a polypeptide consisting of or con:^)rising a polypeptide of the formula: 

x-(RiW{R2HR3)n-Y 

25 wherein, at the amino terminus, X is hydrogen, a metal or any other moiety described herein for modified 
polypeptides, and at the carboxyl terminus, Y is hydrpg^ a metal or any other moiety described herein for 
modified polypeptides, R] and R3 are any amino acid residue or modified ammo acid residue, m is an integer 
between 1 and 1000 or zero, n is an integer between 1 and 1000 or zero, and R2 is an amino acid sequence of 
the mvention, particularly an amino acid sequence selected fi-om Table 1 or modified forms thereof In the 

30 formula above, R2 is oriented so that its amino terminal amino acid residue is at the left, covalently bound to 
Rj and its carboxy tenninal amino acid residue is at the right covalortly bound to R3. Any stretch of amino 
acid residues denoted by either Rj or R3, where m and/or n is greater than 1, may be either a heteropolymer or 
a homopolymer, preferably a heteropolymer. Other preferred embodiments of the invention are provided vsiiere 
m is an integer between 1 and 50, 100 or 500, and n is an integer between 1 and 50, 100, or 500. 



-4- 
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It is most preferred that a polypeptide of the invention is derived from Streptococcus pneumoniae, 
however, it may preferably be obtained from other organisms of the same taxonomic genus. A polypeptide of 
the invention may also be obtained, for example, from organisms of the same taxonomic family or order. 

A fragment is a variant polypeptide having an amino acid sequence Aat is entirely the same as part but 
not all of any amino acid sequence of any polypqrtide of the invention. As with acpS polypeptides, fragments 
may be "free-standing," or comprised within a larger polypeptide of which they form a part or region, most 
preferably as a single continuous region in a single larger polypeptide. 

Preferred fragments include, for exan:5)le, truncation polypeptides having a portion of an amino acid 
sequence of Table 1 [SEQ DD N0:2], or of variants thereof, such as a continuous series of residues that 
includes an amino- and/or carboxyl-terminal amino add sequence. Degradation forms of the polypeptides of 
the invention produced by or in a host cell, particularly a Streptococcus pneumoniae, are also preferred. 
Further preferred are fragments characterized by structural or functional attributes such as fi^gmoits that 
comprise alpha-helix and alpha-helix forming regions, beta-sheet and beta-sheet-forming regions, turn and tum- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic 
regions, beta amphipathic regions, flexible regions, surfece-fonming regions, substrate binding region, and high 
antigenic index regions. 

Further preferred fragments include an isolated polypep^'.ie comprising an amino acid sequence 
havmg at least 15, 20, 30, 40, 50 or 100 contiguous amino acids from the amino acid sequence of SEQ 
ID N0:2, or an isolated polypeptide comprising an amino acid sequence having at least 15, 20, 30, 40, 
50 or 100 contiguous amino acids truncated or deleted from the amino acid sequence of SEQ ID N0:2. 

Fragments of the polypeptides of the invention may be anployed for producing the corresponding full- 
length polypeptide by peptide synthesis; therefore, these variants may be employed as intermediates for 
producing the full-length polypeptides of the invention. 

Polynucleotides 

It IS an object of the invention to provide polynucleotides that encode acpS polypq)tides, particulariy 
polynucleotides that encode a polypeptide herein designated acpS . 

In a particularly preferred embodiment of the invention the polynucleotide comprises a region encoding 
acpS polypeptides comprising a sequence set out in Table 1 [SEQ ID NO: 1] which includes a full laigth gene, 
or a variant thereof The Applicants believe that this full lengdi gene is essential to the growth and/or survival 
of an organism that possesses it, such as Streptococcus pneumoniae. 

As a further aspect of the invention there are provided isolated nucleic acid molecules encoding 
and/or expressing acpS polypeptides and polynucleotides, particularly Streptococcus pneumoniae acpS 
polypeptides and polynucleotides, including, for example, ur^jrocessed RNAs, ribozyme RNAs, mRNAs, 
cDNAs, genomic DNAs, B- and Z-DNAs. Further embodiments of the invmtion include biologically, 
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diagiiostically, prophylacticaliy, clinically or ther^euticaUy useful polynucleotides and polypeptides, and 
variants thereof, and compositions comprising the same. 

Another aspect of the invention relates to isolated polynucleotides, including at least one full length 
gene, that encodes a acpS polypeptide having a deduced amino acid sequaice of Table 1 [SEQ ID N0:2] and 
polynucleotides closely related thereto and variants thereof 

In another particularly preferred embodiment of the invention there is a acpS polypeptide from 
Streptococcus pneumoniae conqjrising or consisting of an amino acid sequence of Table 1 [SEQ ID 
N0:2], or a variant thereof 

Using the infomiation provided herein, such as a polynucleotide sequaice set out in Table 1 [SEQ ID 
NO: I], a polynucleotide of the invention encoding acpS polypeptide may be obtained using standard cloning 
and screening methods, such as those for cloning and sequencing chromosomal DNA ftagments from bacteria 
using Streptococcus pneumoniae 0100993 cells as starting material, followed by obtaining a full lengtii clone. 
For example, to obtain a polynucleotide sequence of the invention, such as a polynucleotide sequence 
given in Table 1 [SEQ ID N0:1], typically a library of clones of chromosomal DNA of Streptococcus 
pneumoniae 0100993 in Kcoli or some other suitable host is probed with a radiolabeled oligonucleotide, 
preferably a 17-mer or longer, derived from a partial sequence. Clones carrying DNA identical to that of 
the probe can then be distinguished using stringent hybridization conditions. By sequencing the individual 
clones thus identified by hybridization with sequencing primers designed from the original polypeptide or 
polynucleotide sequence it is then possible to extoid the polynucleotide sequence in both directions to 
determine a full length gene sequence. Conveniently, such sequencing is performed, for example, using 
denatured double stranded DNA prepared from a plasmid clone. Suitable techniques are described by 
Maniatis, T., Fritsch, E.F. and Sambrook et al,, MOLECULAR CLONING, A LABORATORY MANUAL, 
2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989). (see in particular 
Screening By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70), 
Direct genomic DNA sequencing may also be performed to obtain a full length gene sequence. Dhistrative 
of the invention, each polynucleotide set out in Table 1 [SEQ ID N0:1] was discovered in a DNA hbraiy 
derived from Streptococcus pneumoniae 0100993. 

Moreover, each DNA sequence set out in Table 1 [SEQ ID N0:1] contains an open reading frame 
encoding a protein having about the number of amino acid residues set forth in Table 1 [SEQ ID N0:2] with a 
deduced molecular weight tiiat can be calculated using amino acid residue molecular weight values well known 
to those skilled in the art. The poKracleotide of SEQ ID N0:1, between nucleotide number 1 and the stop 
codon which begias at nucleotide number 361 of SEQ ID N0:1, encodes the polypeptide of SEQ ID N0:2. 

In a further aspect, the presait invaition provides for an isolated polynucleotide comprising or 
consisting of (a) a polynucleotide sequence which has at least 95% identity, even more preferably at 
least 97-99% or exact identity to SEQ ID N0:1 over the entire length of SEQ ID N0:1, or the entire 



wo 99/61452 



PCTAJS99/11704 



length of that portion of SEQ ID N0:1 which encodes SEQ ID N0:2; (b) a polynucleotide sequence 
encoding a polypeptide which has at least 95% identity, even more preferably at least 97-99% or 100% 
exact, to the amino acid sequence of SEQ ID N0:2, over the entire length of SEQ ID N0:2. 

A polynucleotide encoding a polypeptide of the present invention, including homologs and orthologs 
from species other than Streptococcus pneumoniae, may be obtained by a process which comprises the steps of 
screening an appropriate library under stringent hybridization conditions with a labeled or detectable probe 
consistmg of or comprising the sequence of SEQ ID N0:1 or a fragment thereof; and isolating a full-length 
gene and/or genomic clones comprising said polynucleotide sequence. 

The invention provides a polynucleotide sequence identical over its entire length to a coding sequence 
(open reading frame) in Table 1 [SEQ ID N0:1]. Also provided by the invaition is a coding sequence for a 
mature polypeptide or a fragment thereof, by itself as well as a coding sequ^ce for a mature polypq)tide or a 
fragment m reading frame with another coding sequaice, such as a sequence encoding a leader or secretory 
sequence, a pre-, or pro- or prepro-protein sequaice. The polynucleotide of the invention may also comprise at 
least one nonnxiding sequence, including for example, but not limited to at least one non-coding 5' and 3' 
sequence, such as the transcribed but non-translated sequences, termination signals (such as rho-dependent and 
rho-independent termination signals), ribosome binding sites, Kozak sequences, sequences that stabilize mRNA, 
mtrons, and polyadenylation signals. The pc >fmcleotide sequence may also conqsrise additional coding 
sequence encoding additional amino acids. For exanq)le, a marker sequaice that fecilitates purification of a 
fiised polypeptide can be encoded. In certain embodiments of the invention, the marker sequoice is a hexa- 
histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and described in Gentz et al , Proc, Natl Acad 
Sci., USA 86: 821-824 (1989), or an HA peptide tag (Wilson et al. Cell 37: 767 (1984), bodi of which may be 
useful in purifying polypeptide sequaice fiised to them. Polynucleotides of the invention also include, but are 
not limited to, polynucleotides comprising a structural gene and its naturally associated sequences that control 
gene expression. 

A preferred embodiment of the invention is a polynucleotide of consisting of or conqjrising nucleotide 1 
to the nucleotide immediately upstream of or including nucleotide 361 set forth in SEQ ID N0:1 of Table 1, 
both of which encode a acpS polypeptide. 

The invention also includes a polynucleotide consisting of or comprising a polynucleotide of the 

formula: 

X-(Rl)m-(R2)-(R3)n-Y 

wherein, at the 5' end of the molecule, X is hydrogen, a metal or a modified nucleotide residue, or together 
with Y defines a covalent bond, and at the 3' end of the molecule, Y is hydrogen, a metal, or a modified 
nucleotide residue, or together with X defines the covalent bond, each occurrence of Ri and R3 is 
independently any nucleic acid residue or modified nucleic acid residue, m is an integer between 1 and 
3000 or zero , n is an integer between 1 and 3000 or zero, and R2 is a nucleic acid sequence or modified 
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nucleic acid sequence of the invention, particularly a nucleic acid sequence selected from Table 1 or a 
modified nucleic acid sequence thereof In the polynucleotide formula above, R2 is oriented so that its 5' 
end nucleic acid residue is at the left, bound to Rj^ and its 3' end nucleic acid residue is at the right, 
bound to R3. Any stretch of nucleic acid residues denoted by either Rj and/or R2, where m and/or n is 
5 greater than 1 , may be either a heteropolymer or a homopolymer, preferably a heteropolymer. Where, in 
a preferred embodiment, X and Y together define a covalent bond, the polynucleotide of the above 
formula is a closed, circular polynucleotide, which can be a double-stranded polynucleotide wherein the 
formula shows a first strand to which the second strand is complementary. In another preferred 
embodiment m and/or n is an integer between 1 and 1 000. Other preferred embodiments of the invention are 
1 0 provided where m is an integer between 1 and 50, 100 or 500, and n is an integer between 1 and 50, 100, or 
500. 

It is most preferred that a polynucleotide of the invoition is derived firom Streptococcus pneumoniae, 
however, it may preferably be obtained fi-om other oiganisms of the same taxonomic genus. A polynucleotide 
of the invention may also be obtained, for exan5)le, frasa organisms of the same taxonomic femiK' or order. 

1 5 The tenn "polynucleotide encoding a polypqjtide" as used herein encompasses polynucleotides that 

include a sequence encoding a polypeptide of the invention, particularly a bacterial polypeptide and more 
particularly a polypeptide of the Streptococcus pneumoniae acpS having an amino acid sequaice set out ; 
Table 1 [SEQ ID N0:2]. The term also encompasses polynucleotides that include a single continuous region or 
discontinuous regions encoding the polypeptide (for exan^Dle, polynucleotides interrupted by integrated phage, 

20 an integrated insertion sequence, an integrated vector sequence, an integrated transposon sequence, or due to 
RNA editing or genomic DNA reoiganizatian) together with additional r^ons, that also may comprise coding 
and/or non-coding sequences. 

The invention fiirther relates to variants of the polynucleotides described herein that encode variants of 
a polypeptide having a deduced amino acid sequoice of Table 1 [SEQ ID N0:2]. Fragments of polynucleotides 

25 of the invention may be used, for exanqDie, to synthesize fiill-length polynucleotides of the invention. 

Further particularly preferred anbodiments are polynucleotides encoding acpS variants, that have the 
amino acid sequence of acpS polypeptide of Table 1 [SEQ ID N0:2] in which several, a few, 5 to 10, 1 to 5, 1 
to 3, 2, 1 or no amino acid residues are substituted, modified, deleted and/or added, in any combination. 
Especially preferred among these are silent substitutions, additions and deletions, that do not alter the properties 

30 and activities of acpS polypeptide. 

Further preferred embodiments of the invention are polynucleotides that are at least 95% or 97% 
identical over their entire length to a polynucleotide aicoding acpS polypeptide having an amino acid sequence 
set out in Table 1 [SEQ ID N0:2], and polynucleotides that are conq)lementary to such polynucleotides. Most 
highly preferred are polynucleotides that con5)rise a r^on that is at least 95% are especially preferred. 

35 Furthermore, those with at least 97% are highly preferred among those with at least 95%, and among these 
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those with at least 98% and at least 99% are particularly highly preferred, with at least 99% being the more 
preferred. 

Preferred embodiments are polynucleotides encoding polypeptides that retain substantially the same 
biological function or activity as a mature polypeptide aicoded by a DNA of Table 1 [SEQ E) NO: 1]. 

In accordance with certain preferred embodimeats of this invaition tiiere are provided polynucleotides 
that hybridize, particularly under stringent conditions, to acpS polynucleotide sequences, such as those 
polynucleotides in Table 1. 

The invention fiuther relates to polynucleotides tiiat hybridize to the polynucleotide sequoices provided 
herein. In this r^d, the invention especially relates to polynucleotides that hybridize under stringent 
conditions to the polynucleotides described herein. As herein used, the terms "stringent conditions" and 
"stringent hybridization conditions" mean hybridization occurring only if there is at least 95% and prrferably at 
least 97% identity between the sequences. A specific example of stringent hybridization conditions is 
overnight incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150mM NaCl 15mM 
trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10% dextran sulfate, and 
15 20 micrograms/ml of denatured, sheared salmon sperm DNA, followed by washing the hybridization 
support in 0. Ix SSC at about 65°C. Hybridization and wash conditions are well known and exemplified 
m Sambrook, et ai. Molecular Cloning: A Laboratory Manual. Second Edition, Cold Spring Harbor, 
N.Y., (1989), particularly Chapter 11 therein. Solution hybridization may also be used with the 
polynucleotide sequences provided by the mvention. 
20 The invention also provides a polynucleotide consisting of or comprising a polynucleotide 

sequence obtained by screening an appropriate Ubraiy comprising a complete gene for a polynucleotide 
sequence set forth m SEQ ID N0:1 under stringent hybridization conditions with a probe having the 
sequence of said polynucleotide sequence set forth in SEQ ID N0:1 or a fragment thereof; and isolating 
said polynucleotide sequence. Fragments useful for obtaining such a polynucleotide include, for example. 
25 probes and primers fully described elsewhere herein. 

As discussed elsewhere herein regarding polynucleotide assays of the invention, for instance, the 
polynucleotides of the invention, may be used as a hybridization probe for RNA, cDNA and genomic DNA to 
isolate full-length cDNAs and genomic clones encoding acpS and to isolate cDNA and genomic clones of other 
genes that have a high identity, particularly high sequence identity, to a acpS gene. Such probes generally will 
comprise at least 1 5 nucleotide residues or base pairs. Preferably, such probes wiU have at least 30 nucleotide 
residues or base pairs and may have at least 50 nucleotide residues or base pairs. Particularly preferred probes 
will have at least 20 nucleotide residues or base pairs and will have lee than 30 nucleotide residues or base 
pairs. 

A coding region of a acpS gene may be isolated by screening using a DNA sequence provided in Table 
1 [SEQ ID N0:1] to synthesize an oligonucleotide probe. A labeled oUgonucleotide having a sequence 
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complementary to that of a gene of the invention is thai used to screen a libran- of cDNA, genomic DNA or 
mRNA to determine which members of the libraiy the probe hybridizes to. 

There are several methods available and well known to those skilled in the art to obtain full- 
length DNAs, or extend shon DNAs, for example those based on the method of Rapid Amplification of 
cDNA ends (RACE) (see, for example, Frohman, etaL, PNAS USA 85: 8998-9002, 1988). Recent 
modifications of the technique, exemplified by the Marathon*™ technology (Clontech Laboratories Inc.) 
for example, have significantly simplified the search for longer cDNAs. In the Marathon*™ technology, 
cDNAs have been prepared from mRNA extracted firom a chosen tissue and an 'adaptor' sequence ligated 
onto each end. Nucleic acid amplification (PCR) is then carried out to amplify the "missing" 5' end of the 
DNA using a combination of gene specific and ad^tor specific oligonucleotide primers. The PCR 
reaction is then repeated using "nested" primers, that is, primers designed to anneal within the aitq)lified 
product (typically an adaptor specific primer that anneals further 3' in the adaptor sequence and a gene 
specific primer that anneals further 5' in the selected gene sequence). The products of this reaction can 
then be analyzed by DNA sequencing and a full-length DNA constructed either by joining the product 
directly to the existing DNA to give a complete sequence, or carrying out a separate full-length PCR 
using the new sequence information for the design of the 5' primer. 

The polynucleotides and polypeptides of the invention may be employed, for exa^ \ple, as research 
reagents and materials for discovery of treatments of and diagnostics for diseases, particularly human diseases, 
as further discussed herein relating to polynucleotide assays. 

The polynucleotides of the invention that are oligonucleotides derived from a sequence of Table 1 
[SEQ ID N0S:1 or 2] may be used in the processes herein as described, but preferably for PCR, to 
determine whether or not the polynucleotides identified herein in whole or in part are transcribed in 
bacteria in infected tissue. It is recognized that such sequences will also have utility in diagnosis of the 
stage of infection and type of infection the pathogen has attained. 

The invention also provides polynucleotides that oicode a polypeptide that is a mature protein phis 
additional amino or carboxyl-terminal amino acids, or amino acids interior to a mature polypeptide (when a 
mature form has more than one polypeptide diain, for instance). Such sequaices may play a role in processing 
of a protem from precursor to a mature form, may allow protein transport may lengthen or shorten protein half- 
life or may facUitate manipulation of a protein for assay or production, among other things. As generally is the 
case in vivo, the additional amino acids may be processed away from a mature protein by cellular ^izymes. 

For each and evers' polynucleotide of the invention there is provided a polynucleotide complanenlary to 
It. h is preferred that these complementan' polynucleotides are fiilly complementary to each polynucleotide 
with which they are complem^tary. 

A precursor protein, having a mature form of the polypeptide fiised to one or more prosequences may 
be an inactive fonn of the polypeptide. When prosequoices are removed such inactive precursors generally are 
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activated. Some or all of the prosequences may be removed before activation- Generally, such precursors are 
called proproteins. 

In sum, a polynucleotide of the invention may encode a mature protein, a mature protein plus a leader 
sequence (which may be referred to as a preprotein), a precursor of a mature protein having one or more 
prosequences that are not the leader sequences of a preprotein, or a preproprotein, \^4iich is a precursor to a 
proprotein, having a leader sequence and one or more prosequences, which goierally are removed during 
processing steps that produce active and mature forms of the polypeptide. 

Vectors, Host Cells, Expression Systems 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of the invention, 
host cells that are genetically engineered with vectors of the invoition and the production of polypeptides of the 
invention by recombinant techniques. Cell-free translation systems can also be employed to produce such 
proteins using RNAs derived from the DNA constructs of the invention. 

Recombinant polypeptides of the present invention may be prepared by processes well known in those 
skilled in the art from genetically engineered host cells comprising e?q)ression systems. Accordingly, in a 
fiirther aspect, the present invention relates to expression systems v^ch comprise a polynucleotide or 
polynucleotides of the present invention, to host cells vAdch big genetically engineered with such expression 
systems, and tc the production of polypeptides of the invention by recombinant techniques. 

For recombinant production of the polypq)tides of the invoition, host cells can be genetically 
engineered to incorporate expression systems or portions thereof or polynucleotides of the invention. 
Introduction of a polynucleotide into the host cell can be effected by methods described in many standard 
laboratory manuals, such as Davis, et aL, BASIC METHODS IN MOLECULAR BIOLOGY, (1986) and 
Sambrook, et aL, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium phosphate transfection, DEAE-de?ctran 
mediated transfection, transvection, microinjection, cationic lipid-mediated transfection, electroporation, 
transduction, scrape loading, baUistic introduction and mfection. 

Representative examples of appropriate hosts include bacterial cells, such as cells of streptococci, 
staphylococci, enterococci E. coli, streptomyces, cyanobacteria. Bacillus subtilis, and Streptococcus 
pneumoniae, fimgal cells, such as cells of a yeast, Kluveromyces, Saccharomyces, a basidiomycete, Candida 
albicans and Aspergillus: insect cells such as cells of Drosophila S2 and Spodoptera Sf9; animal cells such as 
CHO, COS, HeLa, C 1 27, 3T3, BHK, 293, CV-1 and Bowes melanoma cells; and plant cells, such as ceUs of a 
gymnosperm or angiosperm. 

A great variet>^ of expression systems can be used to produce the polypeptides of the invention. Such 
vectors include, among otiiers, chromosomal-, episomal- and virusKlerived vectors, for exanqDle, vectors derived 
from bacterial plasmids, from bacteriophage, frtm transposons, from yeast episomes, from insertion elements, 
from yeast chromosomal elemoits, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
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viruses, adenoviruses, fowl pox viruses, pseudorabies viruses, picomaviruses and retroviruses, and vectors 
derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, 
such as cosmids and phaganids. The expression system constructs may comprise control regions that regulate 
as well as engender expression. Generally, any system or vector suitable to maintain, propagate or express 
5 polynucleotides and/or to express a polypeptide in a host may be used for ejqsression in this regard. The 
^propnate DNA sequence may be inserted into the ejqjressicn system by any of a variety of well-known and 
routine techniques, such as, for example, those set forth in Sambrook et al., MOLECULAR CLONING, A 
LABORATORY MANUAL, (supra). 

In recombinant expression systems in eukaiyotes, for secretion of a translated protein into the lumen of 

10 the endoplasmic reticulum, into the periplasmic space or into the extracellular environment, appropriate 
secretion signals may be incorporated into the e?q)ressed polypeptide. These signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

Polypeptides of the invention can be recovered and purified from recombinant cell cultures by well- 
known methods including ammonium sulfite or ethanol precipitation, acid extraction, anion or cation exchange 

1 5 chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, and lectin chromatography. Most preferably, high 
performance liquid chromatography is employed for purification. Well knov.v. techniques for refolding protein 
may be employed to regenerate active conformation when the polypqrtide is denatured during isolation and or 
purification. 

20 Diagnostic, Prognostic, Serotyping and Mutation Assays 

This invention is also related to the use of acpS polynucleotides and polypeptides of the invention for 
use as diagnostic reagents. Detection of acpS polynucleotides and/or polypeptides in a eukaryote, particularly a 
mammal, and especially a himian, will provide a diagnostic method for diagnosis of disease, staging of disease 
or response of an infectious organism to drugs. Eukaryotes, particularly mammals, and especially humans, 

25 particularly those infected or suspected to be infected with an organism comprising the acpS gene or protein, 
may be detected at the nucleic acid or amino acid level by a variety of well known tediniques as well as by 
methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be obtained from a 
putatively infected and/or infected individual's bodily materials. Polynucleotides from any of these sources, 

3 0 particularly DNA or RN A, may be used directly for detection or may be ampUfied enzymatically by using PGR 
or any other anq)lification technique prior to analj'sis. RNA, particularly mRNA, cDNA and genomic DNA 
may also be used in the same ways. Using amplification, characterization of the species and strain of infectious 
or resident organism present in an individual, may be made by an analysis of the genotype of a selected 
polynucleotide of the organism. Deletions and insertions can be detected by a change in size of the amplified 

35 product in comparison to a gaiotype of a referoice sequence selected from a related organism, preferably a 
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different species of the same genus or a difierent strain of the same species. Point mutations can be identified 
by hybridizing amplified DNA to labeled acpS polynucleotide sequaices. Perfectl\* or significantly matched 
sequences can be distinguished fi-om inqjerfectly or more significantly mismatched duplexes by DNase or 
RNase digestion, for DNA or RNA respectively, or by detecting differences in melting temperatures or 
renaturation kinetics. Polynucleotide sequaice diflEerences may also be detected by alterations in the 
electrophoretic mobility of polynucleotide firagments in gels as compared to a reference sequaice. This may be 
carried out with or without denaturing agents. Polynucleotide diflFerences may also be detected by direct DNA 
or RNA sequencing. See, for example, Myers et a/., Science, 230: 1242 (1985). Sequence changes at specific 
locations also may be revealed by nuclease protection assays, such as RNase, VI and S 1 protection assay or a 
chemical cleavage method. See, for example. Cotton et al, Proc. Natl Acad Set, USA, 85: 4397-4401 
(1985). 

In another embodiment an array of oligonucleotides probes comprising acpS nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of, for exan^le, genetic mutations, serotype, 
taxonomic classification or identification. Array technology methods are well known and have general 
applicability and can be used to address a variety of questions in molecular genetics including gene expression, 
genetic linkage, and genetic variability (see, for example, Chee et al. Science, 274: 610 (1996)). 

Thus in another aspect, the present invention relates to a diagnostic kit which comprises: 
(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ ID N0:1, or a 
fragment thereof ; (b) a nucleotide sequence complementary to that of (a); (c) a polypeptide of the present 
invention, preferably the polypeptide of SEQ ID N0:2 or a fi-agment thereof: or (d) an antibody to a 
polypeptide of the present invention, preferably to the polypeptide of SEQ ID N0:2. It will be 
appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial component. Such a kit 
will be of use in diagnosing a disease or susceptibility to a Disease, among others. 

This invention also relates to the use of polynucleotides of the present invention as diagnostic reagents. 
Detection of a mutated form of a polynucleotide of the invaition, preferable, SEQ ID NO: 1, vtoch is associated 
with a disease or pathogenicity will provide a diagnostic tool that can add to, or define, a diagnosis of a disease, 
a prognosis of a course of disease, a detennination of a stage of disease, or a susceptibility to a disease, wdiich 
results fi-om under-expression, over-expression or altered expression of the polynucleotide. Organisms, 
particulariy infectious organisms, carrying mutations in such polynucleotide may be detected at the 
polynucleotide level by a variety of techniques, such as those described elsewhere herein. 

The differences in a polynucleotide and/or polypeptide sequence between organisms possessing a 
first phenot>T)e and organisms possessing a different, second dififerent phenotype can also be determined. 
If a mutation is observed in some or all organisms possessing the first phenotype but not in any organisms 
possessing the second phenotype, then the mutation is likely to be the causative agent of the first 
phenotype. 
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Cells from an organism carrying mutations or polymoiphisms (allelic variations) in a polynucleotide 
and/or pol>peptide of the invention may also be detected at the polynucleotide or polypeptide level by a varietv- 
of techniques, to allow for serotyping, for example. For example, RT-PCR can be used to detea mutations in 
the RNA. It is particularly preferred to use RT-PCR in conjunction with automated detection systems, such as, 
for example, GeneScan. RNA, cDNA or genomic DNA may also be used for the same purpose, PCR. As an 
example, PCR primers complementary to a polynucleotide encoding acpS polypeptide can be used to identify 
and analyze mutations. The invention further provides these primers with 1 , 2, 3 or 4 nucleotides removed from 
the 5' and/or the 3* end. These primers may be used for, among other things, an:5)Iifying acpS DNA and/or 
RNA isolated from a sample derived from an individual, such as a bodily material. The primers may be used to 
amplify a polynucleotide isolated from an infected individual, such that the polynucleotide may then be subject 
to various techniques for elucidation of the polynucleotide sequence. In this way, mutations in the 
polynucleotide sequence may be detected and used to diagnose and/or prognose the infection or its stage or 
course, or to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial infections, 
more preferably infections caused by Streptococcus pneumoniae, comprising determining from a sample 
derived from an individual, such as a bodily material, an increased level of expression of polynucleotide 
having a sequence of Table 1 [SEQ ID N0:1]. V/icreased or decreased expression of a acpS 
polynucleotide can be measured using any on of the methods well known in the art for the quantitation of 
polynucleotides, such as, for example, amplification, PCR, RT-PCR, RNase protection. Northern 
blotting, spectrometry and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over-e?q)ressiQn of acpS 
polypeptide compared to nonnal control tissue saxaples may be used to detect the presence of an infection, for 
example. Assay techniques that can be used to d^emune levels of a acpS polypeptide, in a sanq)le derived 
from a host, such as a bodily material, are well-known to those of skill in the art. Such assay methods inchide 
radioimmunoassays, competitive-binding assays, Western Blot analysis, antibody sandwich assays, antibody 
detection and ELISA assays. 

Antagonists and Agonists - Assays and Molecules 

Polypeptides and polynucleotides of the invention may also be used to assess the binding of small 
molecule substrates and ligands in, for example, cells, ceU-free preparations, chemical Ubraries, and natural 
product mixtures. These substrates and hgands may be natural substrates and ligands or may be structural or 
functional mimetics. See, e.g., Coligan et al. Current Protocols in Immunology 1(2): Chapter 5 (1991). 

Polypeptides and polynucleotides of the present invention are responsible for many biological 
functions, uicluding many disease states, in particular the Diseases herein mentioned. It is therefore desirable to 
devise screaiing methods to identify compounds v*idi stimulate or which inhibit the fiinction of the polypeptide 
or polynucleotide. Accordingly, in a fiirther aspect, the present invaition provides for a method of screening 
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compounds to identify those which stimulate or whidi inhibit the function of a polypqrtide or polynucleotide of 
the invention, as well as related polypeptides and polynucleotides. In general, agonists or antagonists {e.g., 
inhibitors) may be employed for therapeutic and prophylactic purposes for such Diseases as herein mentioned. 
Compounds may be identified fi-om a variety of sources, for example, cells, cell-firee preparations, chemical 
libraries, and natural product mixtures. Such agonists and antagonists so-identified may be natural or modified 
substrates, ligands, receptors, enzymes, etc., as the case may be, of acpS polypeptides and polynucleotides; or 
may be structural or functional mimetics thereof (see Coligan et al , Current Protocols in Immunology 
l(2):Chapter5(1991)). 

The screening methods may simply measure the binding of a candidate compound to the 
polypeptide or polynucleotide, or to cells or membranes bearing the polypeptide or polynucleotide, or a 
fusion protein of the polypeptide by means of a label directly or indirectly associated with the candidate 
compound. Alternatively, the screening method may involve competition with a labeled competitor. 
Further, these screening methods may test whether the candidate compound results in a signal generated 
by activation or inhibition of the polypeptide or polynucleotide, using detection systems appropriate to the 
cells comprising the polypeptide or polynucleotide. Inhibitors of activation are generally assayed in the 
presence of a known agonist and the effect on activation by the agonist by the presence of the candidate 
compound is observed. Constitutively active polypeptide and/or constitutively expressed polypeptides 
and polynucleotides may be employed in screening methods for inverse agonists, in the absence of an 
agonist or antagonist, by testing whether the candidate compound results in inhibition of activation of the 
polypeptide or polynucleotide, as the case may be. Further, the screening methods may simply comprise 
the steps of mixing a candidate compound with a solution comprising a polypeptide or polynucleotide of 
the present invention, to form a mixture, measuring acpS polypeptide and/or polynucleotide activity in the 
mixture, and comparing the acpS polypeptide and/or polynucleotide activity of the mixture to a standard. 
Fusion proteins, such as those made fi-om Fc portion and acpS polypeptide, as herein described, can also 
be used for high-throughput screening assays to identify antagonists of the polypeptide of the present 
invention, as well as of phylogenetically and and/or fimctionally related polypeptides (see D, Bennett et 
al, J Mol Recognition, 8:52-58 (1995); and K. Johanson et al., J Biol Chem, 270(1 6):9459-9471 
(1995)). 

The polynucleotides, polypeptides and antibodies that bind to and/or interact with a polypeptide of 
the present invention may also be used to configure screening methods for detecting the effect of added 
compounds on the production of mRNA and/or polypeptide in cells. For example, an ELISA assay may 
be constructed for measuring secreted or cell associated levels of polypeptide using monoclonal and 
polyclonal antibodies by standard methods known in the art. This can be used to discover agents which 
may mhibit or enhance tiie production of polypeptide (also called antagonist or agonist, respectively) from 
suitably manipulated cells or tissues. 
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The invention also provides a method of screaiing compounds to identify those which enhance 
(agonist) or block (antagonist) the action of acpS polypeptides or polynucleotides, particularly those compounds 
that are bacteristatic and/or bactericidal. The method of screening may involve high-throughput techniques. 
For example, to screen for agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a 
membrane, cell envelojje or ceU wall, or a preparation of any thereof, comprising acpS polypeptide and a 
labeled substrate or hgand of such polypq)tide is incubated in the absence or the presence of a candidate 
molecule that may be a acpS agonist or antagonist. The ability of tfie candidate molecule to agonize or 
antagonize the acpS polypeptide is reflected in decreased binding of the labeled ligand or decreased production 
of product from such substrate. Molecules that bind gratuitously, /.e., without inducing the eflfects of acpS 
polypeptide are most likely to be good antagonists. Molecules tiiat bind well and, as the case may be, increase 
the rate of product production from substrate, increase signal transduction, or increase chemical channel activity 
are agonists. Detection of the rate or level of, as the case may be, production of product from substrate, signal 
transduction, or chemical channel activity may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorimetric, labeled substrate converted into product, 
a reporter gene that is responsive to changes in acpS polynucleotide or polypeptide activity, and binding assays 
known in the art. 

Polypeptides of the invention may i ; j used to identify membrane bound or soluble receptors, if 
any, for such polypeptide, through standard receptor binding techniques known in the art. These 
techmques include, but are not limited to, Ugand binding and crosslinking assays in which the 
polypeptide is labeled with a radioactive isotope (for instance, ^^^T), chemically modified (for instance, 
biotinylated), or fused to a peptide sequence suitable for detection or purification, and incubated with a 
source of the putative receptor (e.g., cells, cell membranes, cell supematants, tissue extracts, bodily 
materials). Other methods include biophysical techniques such as surface plasmon resonance and 
spectroscopy. These screening methods may also be used to identify agonists and antagonists of the 
polypeptide which compete with die binding of the polypeptide to its receptor(s), if any. Standard 
methods for conducting such assays are well understood in the art. 

The fluorescence polarization value for a fluorescently-tagged molecule depends on the 
rotational correlation time or tumbling rate. Protein complexes, such as formed by acpS polypeptide 
associating with another acpS polypeptide or other polypeptide, labeled to comprise a fluorescently- 
labeled molecule will have higher polarization values than a fluorescently labeled monomeric protein. It 
is preferred that this method be used to characterize small molecules that disrupt polypeptide 
complexes. 

Fluorescence energy transfer may also be used characterize small molecules that interfere with 
the formation of acpS polypeptide dimers, trimers, tetramers or higher order structures, or structures 
formed by acpS polypeptide bound to another polypeptide. acpS polypeptide can be labeled with both 
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a donor and acceptor fluorophore. Upon mixing of the two labeled species and excitation of the donor 
fluorophore, fluorescence energy transfer can be detected by observing fluorescence of the acceptor. 
Compounds that block dimerization will inhibit fluorescence energy transfer. 

Surface plasmon resonance can be used to monitor the effect of small molecules on acpS 
polypeptide self-association as well as an association of acpS polypeptide and another polypeptide or 
small molecule. acpS polypeptide can be coupled to a sensor chip at low site density such that 
covalently bound molecules will be monomeric. Solution protein can then passed over the acpS 
polypeptide -coated sxirface and specific binding can be detected in real-time by monitoring the change 
in resonance angle caused by a change in local refractive index. This technique can be used to 
characterize the effect of small molecules on kinetic rates and equilibrium binding constants for acpS 
polypeptide self-association as well as an association of acpS polypeptide and another polypeptide or 
small molecule. 

A scintillation proximity assay may be used to characterize the interaction between an 
association of acpS polypeptide with another acpS polypeptide or a different polypeptide . acpS 
polypeptide can be coupled to a scintillation-filled bead. Addition of radio-labeled acpS polypeptide 
results in binding where the radioactive source molecule is in close proximity to the scintillation fluid. 
Thus, signal is emitted upon acpS polypeptide binding and compoxmds that prevent acpS polypeptide 
self-association or an association of acpS polypeptide and another polypeptide or small molecule will 
diminish signal. 

In odier embodiments of the invaition there are provided methods for identifying compounds which 
bind to or otherwise interact with and inhibit or activate an activity or expression of a polypqjtide and/or 
polynucleotide of the inv^tion comprising: contacting a polypeptide and/or polynucleotide of the invention 
with a compound to be screened under conditions to pemiit binding to or other interaction between the 
compound and the polypeptide and/or polynucleotide to assess the binding to or other interaction with the 
compound, such binding or interaction prefisrably being associated with a second conqjonent capable of 
providing a detectable signal in response to the binding or interaction of the polypeptide and/or polynucleotide 
with the compound; and determining whether the conqjound binds to or otherwise interacts with and activates 
or inhibits an activity or expression of the polypeptide and/or polynucleotide by detecting the presence or 
absence of a signal generated firom the binding or interaction of the conqjound with the polypqjtide and/or 
polynucleotide. 

Another example of an assay for acpS agonists is a competitive assay that combines acpS and a 
potential agonist with acpS-binding molecules, recombinant acpS binding molecules, natural substrates or 
ligands, or substrate or ligand mimetics, under appropriate conditions for a conqjetitive inhibition assay. acpS 
can be labeled, such as by radioactivity or a colorimetric compound, such that the number of acpS molecules 
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bound to a binding molecule or converted to product can be detennined accurately to assess the eflfectiveness of 
the potential antagonist. 

It will be readily appreciated by the skiUed artisan that a polypeptide and/or polynucleotide of the 
present invention may also be used in a method for the structure-based design of an agonist or antagonist 
of the polypeptide and/or polynucleotide, by: (a) determining in the first instance the three-dimensional 
structure of the polypeptide and/or polynucleotide, or complexes thereof; (b) deducing the three- 
dimensional structure for the likely reactive site(s), binding site(s) or motif(s) of an agonist or antagonist; 

(c) synthesizing candidate compounds that are predicted to bind to or react with the deduced binding 
site(s), reactive site(s), and/or motif(s); and 

(d) testing whether the candidate compounds are indeed agonists or antagonists. 

It will be further appreciated that this will normally be an iterative process, and this iterative process may 
be performed using automated and conq)uter-controlled steps. 

In a fiuther aspect the presort invoition provides methods of treating abnonnal conditions such as, for 
instance, a Disease, related to either an excess of, an under-expression of, an elevated activity of, or a decreased 
activity of acpS polypeptide and/or polynucleotide. 

If the expression and/or activity of the polypeptide and/or polynucleotide is in excess, several 
approaches are available. One approach comprises administering to an individual in need thereof an inhibitor 
compound (antagonist) as herein described, optionally in combination with a phannaceutically acceptable 
carrier, in an amount effective to inhibit the function and/or e?q)ressian of the polypeptide and/or polynucleotide, 
such as, for example, by blocking the binding of ligands, substrates, receptors, enzymes, etc., or by inhibiting a 
second signal, and thereby alleviating the abnonnal condition. In another approach, soluble forms of the 
polypeptides still capable of binding the ligand, substrate, enzymes, receptors, etc. m competition with 
endogenous polypeptide and/or polynucleotide may be administered. Typical examples of such competitors 
include fragments of the acpS polypeptide and/or polypeptide. 

In still another approach, expression of the gene encoding endogenous acpS polypeptide can be 
inhibited using expression blocking techniques. This blocking may be targeted against any step in gene 
expression, but is preferably targeted against transcription and/or translation. An examples of a known 
technique of this sort involve die use of antisense sequences, either intemaUy generated or separately 
administered (see, for example, O'Connor, JNeurochem (1991) 56:560 in Ohgodeoxynucleotides as 
Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Ahematively, 
oligonucleotides which form triple helices with die gene can be supplied (see, for example, Lee ei aL, 
Nucleic Acids Res (1979) 6:3073; CoontyetaL Science (1988) 241:456; Dervane/^/.. Science (1991) 
25 1 : 1360). These oligomers can be administered per se or die relevant oligomers can be expressed in 
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Each of the polynucleotide sequences provided herein may be used in the discoven^ and 
development of antibacterial compounds. The encoded protein, upon expression, can be used as a target 
for the screening of antibacterial drugs. Additionally, the polynucleotide sequences encoding the amino 
terminal regions of the encoded protein or Shine-Delgamo or other translation facilitating sequences of 
the respective mRNA can be used to construct antisense sequences to control the expression of the coding 
sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide, agonist or antagonist of 
the invention to interfere with the initial physical interaction between a pathogen or pathogens and a 
eukaryotic, preferably mammalian, host responsible for sequelae of infection. In particiilar, the molecules 
of the invention may be used: in the prevention of adhesion of bacteria, in particular gram positive and/or 
gram negative bacteria, to eukaryotic, preferably mammalian, extracellular matrix proteins on in-dwelling 
devices or to extracellular matrix proteins in wounds; to block bacterial adhesion between exxkaryotic, 
preferably mammalian, extracellular matrix proteins and bacterial acpS proteins that mediate tissue 
damage and/or; to block the normal progression of pathogenesis in infections initiated other than by the 
implantation of in-dwelling devices or by other surgical techniques. 

In accordance with yet another aspect of the invention, there are provided acpS agonists and 
antagonists, preferably bacteristatic or bactericidal agonists and antagonists. 

The antagonists and agonists of the invention may be onployed, for instance, to prevent, inhibit and/or 
treat diseases. 

Helicobacter pylori (herein "i/. pylori*') bacteria infect the stomachs of over one-third of the 
worid's population causing stomach cancer, ulcers, and gastritis (International Agency for Research on 
Cancer (1994) Schistosomes, Liver Flukes and Helicobacter Pylori (International Agency for Research 
on Cancer, Lyon, France, http://www.uiccxh/ecp/ecp2904.htm). Moreover, the International Agency for 
Research on Cancer recently recognized a cause-and-efiFect relationship between H pylori and gastric 
adenocarcinoma, classifying the bacterium as a Group I (definite) carcinogen. Preferred antimicrobial 
compounds of the invention (agonists and antagonists of acpS polypeptides and/or polynucleotides) found 
using screens provided by the invention, or known in the art, particularly narrow-spectrum antibiotics, 
should be useful in the treatment of H. pylori infection. Such treatment should decrease the advent of H 
/?v/or/-induced cancers, such as gastrointestinal carcinoma. Such treatment should also prevent, inhibit 
and/or cure gastric ulcers and gastritis. 

All publications and references, including but not limited to patents and patent applications, cited 
in this specification are herein incorporated by reference in their entirety as if each individual publication 
or reference were specifically and individually indicated to be incorporated by reference herein as being 
fully set forth. Any patent application to which this appUcation claims priority is also incorporated by 
reference herein in its entirety in the manner described above for publications and references. 
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GLOSSARY 

The following definitions are provided to facilitate understanding of certain tenns used frequently 

herein. 

"Bodily material(s) means any material derived from an individual or from an organism infecting, 
infesting or inhabiting an individual, including but not limited to, cells, tissues and waste, such as, bone, blood, 
serum, cerebrospinal fluid, semen, saliva, muscle, cartilage, organ tissue, skin, urine, stool or autopsy 
materials.. 

"Disease(s)" means any disease caused by or related to infection by a bacteria, including , for example, 
otitis media, conjunctivitis, pneumonia, bacteremia, moiingitis, sinusitis, pleural empyema and endocarditis, 
and most particularly meningitis, such as for example infection of cerebrospinal fluid. 

"Host cell(s)" is a cell which has been transformed or transfected, or is capable of transformation 
or transfection by an exogenous polynucleotide sequence. 

"Identit>'," as known in the art, is a relationshq) betwe^ two or more polypeptide sequences or two or 
more polynucleotide sequences, as the case may be, as determined by confq}aring the sequences. In the art, 
"identity" also means the degree of sequence relatedness between polypeptide or polynucleotide sequences, 
as the case may be, as determined by the match between strings of such sequences. "Identity" can be 
readily calculated by known methods, including but not limited to those described in {Computational 
Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1 988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 
1994; Sequence Analysis in Molecular Biology, vonHeinje, G., Academic Press, 1987; dnd Sequence 
Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, 
H., and Lipman, D., SIAM J, Applied Math,. 48: 1073 (1988). Methods to determine identity are 
designed to give the largest match between the sequences tested. Moreover, methods to determine identity 
are codified in publicly available computer programs. Conq)uter program methods to determine identity 
between two sequences include, but are not limited to, the GCG program package (Devereux, J., et al.. 
Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S.F. et al., 1 
Molec. Biol 215: 403-410 (1990). The BLAST X program is publicly available from NCBI and other 
sources {BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al, 
1 Mol Biol 215: 403-410 (1990). The well known Smith Waterman algorithm may also b^ used to 
determine identity. 

Parameters for polypeptide sequence comparison include the following: Algorithm: Needleman 
and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
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Comparison matrix: BLOSSUM62 from HentikofFand Hentikoff, Proc. Natl. Acad. Sci. USA. 
89:10915.10919(1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program from Genetics 
Computer Group, Madison WI. The aforementioned parameters are the default parameters for peptide 
comparisons (along with no penalty for end g^s). 

Parameters for polynucleotide comparison include the following: Algorithm: Needleman and 
Wunsch, J. Mol Biol. 48: 443^53 (1970) 
Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

Available as: The "gap" program from Genetics Computer Group, Madison WI. These are the defauh 
parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may be, are 
provided in (1) and (2) below. 

( 1 ) Polynucleotide embodiments further include an isolate^^ polynucleotide comprising a 
polynucleotide sequence having at least a 50, 60, 70, 80, 85, 90, 95, 97 or 100% identity to the reference 
sequence of SEQ ID NO: 1, wherein said polynucleotide sequence may be identical to the reference 
sequence of SEQ ID NO: 1 or may include up to a certain integer number of nucleotide alterations as 
compared to the reference sequence, wherein said alterations are selected from the group consisting of at 
least one nucleotide deletion, substitution, including transition and transversion, or insertion, and wherein 
said alterations may occur at the 5' or 3' tenninal positions of the reference nucleotide sequence or 
anywhere between those terminal positions, interspersed either individually among the nucleotides in the 
reference sequence or in one or more contiguous groups within the reference sequence, and wherein said 
number of nucleotide alterations is determined by multiplying the total number of nucleotides in SEQ ID 
NO: 1 by the integer defining the percent identity divided by 1 00 and then subtracting that product from 
said total number of nucleotides in SEQ ID N0:1, or: 

"n ^ " (Xj, . y), 

wherein is the number of nucleotide alterations, is the total number of nucleotides in SEQ ID 
N0:L y is 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for the multiplication 
operator, and wherein any non-integer product of x^ and y is rounded down to the nearest integer prior to 
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subtracting it from x^. Alterations of a polynucleotide sequence encoding the polypeptide of SEQ ID 
NO: 2 may create nonsense, missense or frameshift mutations in this coding sequence and thereby alter the 
polypeptide encoded by the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of the present invention may be identical to the 
reference sequence of SEQ ID NO: 1 , that is it may be 100% identical, or it may include up to a certain 
integer number of nucleic acid alterations as conapared to the reference sequence such that the percent 
identity is less than 100% identity. Such alterations are selected from the group consisting of at least one 
nucleic acid deletion, substitution, including transition and transversion, or insertion, and wherein said 
alterations may occur at the 5' or 3' terminal positions of the reference polynucleotide sequence or 
anywhere between those terminal positions, interspersed either individually among the nucleic acids in the 
reference sequence or in one or more contiguous groups within the reference sequence. The number of 
nucleic acid alterations for a given percent identity is determined by multiplying the total number of 
nucleic acids in SEQ ID NO: 1 by the integer defining the percent identity divided by 100 and then 
subtracting that product from said total number of nucleic acids in SEQ ID NO: 1 , or: 

^ >^n - (Xn • Vl 

wherein is the number of nucleic acid alterations, is the total number of nucleic acids in SEQ ID 
NO: 1 , y is, for instance 0.95 for 95%, 0.97 for 97% or 1 .00 for 1 00%, etc., • is the symbol for the 
multiplication operator, and wherein any non-integer product of Xjj and y is roimded down to the nearest 
integer prior to subtracting it from x,^. 

(2) Polypeptide embodiments ftirther include an isolated polypeptide comprising a 
polypeptide having at least a 95, 97 or 100% identity to a polypeptide reference sequence of SEQ ID 
N0:2, wherein said polypeptide sequence may be identical to the reference sequence of SEQ ID N0:2 or 
may include up to a certain integer number of amino acid alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting of at least one amino acid 
deletion, substitution, including conservative and non-conservative substitution, or insertion, and wherein 
said alterations may occur at the amino- or carboxy-terminal positions of the reference polypeptide 
sequence or anywhere between those terminal positions, interspersed either individually among the amino 
acids in the reference sequence or in one or more contiguous groups within the reference sequence, and 
wherein said number of amino acid alterations is determined by multiplying the total number of amino 
acids in SEQ ID N0:2 by the integer defining the percent identity divided by 100 and then subtracting 
that produa from said total number of amino acids in SEQ ID N0:2, or: 
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wherein is the number of amino acid alterations, is the total number of amino acids in SEQ ID 
N0:2, y is 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for the multiplication 
operator, and wherein any non-integer product of Xa and y is rounded down to the nearest integer prior to 
subtracting it from x^. 

By way of example, a polypeptide sequence of the present invention may be identical to the 
reference sequence of SEQ ID N0:2, that is it may be 100% identical, or it may include up to a certain 
integer number of amino acid alterations as compared to the reference sequence such that the percent 
identity is less than 100% identity. Such alterations are selected from the group consisting of at least one 
amino acid deletion, substitution, including conservative and non-conservative substitution, or insertion, 
and wherein said alterations may occur at the amino- or carboxy-terminal positions of the reference 
polypeptide sequence or anywhere between those terminal positions, interspersed either individually 
among the amino acids in the reference sequence or in one or more contiguous groups within the reference 
sequence. The number of amino acid alterations for a given % identity is determined by multiplying the 
total number of amino acids in SEQ ID N0:2 by the integer defining the percent identity divided by 100 
and then subtracting that product from said total number of amino acids in SEQ ID N0;2, or: 

"a ^ - • y), 



wherein n^ is the number of amino acid alterations, x^ is the total number of amino acids in SEQ ID 
N0:2, y is, for instance 0.95 for 95%, 0.97 for 97% or 1 .00 for 100%, etc., and • is the symbol for the 
multiplication operator, and wherein any non-integer product of Xa and y is rounded down to the nearest 
integer prior to subtracting it from x^. 

"Individual(s)" means a multicellular eukaryote, including, but not limited to a metazoan, a mammal, 
an ovid, a bovid a simian, a primate, and a human. 

"Isolated" means altered "by the hand of man" from its natural state, Le., if it occurs in nature, it has 
been changed or removed from its original environment, or both. For example, a polynucleotide or a polypeptide 
naturaUy present in a hving organism is not "isolated," but the same polynucleotide or polypeptide separated 
from the coexisting materials of its natural state is "isolated", as the tenn is employed herein. Moreover, a 
polynucleotide or polypeptide that is introduced into an organism by transformation, genetic manipulation or by 
any other recombinant method is "isolated" even if it is still present in said organism, which organism may be 
living or non-living. 
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"Organisin(s)" means a (i) prokaiyote, including but not limited to, a member of the genus 
Streptococcus. Staphylococcus. Bordetella, Corynebacterium, Mycobacterium. Neisseria, Haemophilus. 
Actinomycetes. Streptomycetes. Nocardia. Enterobacter. Yersinia, Fancisella. Pasturella. Moraxelh. 
Acinetobacter, Erysipelolhrix, Branhamella, Actinobacillus, Streptobacillus. Listeria, Calymmatobacterium. 
Brucella. Bacillus, Clostridium. Treponema. Escherichia, Salmonella. Klebsiella, Vibrio. Proteus. Erwinia, 
Bonelia, Leptospira, Spirillum. Campylobacter, Shigella, Legionella, Pseudomonas. Aeromonas, Rickettsia, 
Chlamydia. Borrelia and Mycoplasma, and further including, but not limited to, a member of the species or 
group. Group A Streptococcus. Group B Streptococcus. Group C Streptococcus. Group D Streptococcus. 
Group G Streptococcus. Streptococcus pneumoniae. Streptococcus pyogenes. Streptococcus agalactiae. 
Streptococcus faecalis. Streptococcus faecium, Streptococcus durans. Neisseria gonorrheae. Neisseria 
meningitidis. Staphylococcus aureus. Staphylococcus epidermidis, Corynebacterium diptheriae. Gardnerella 
vaginalis. Mycobacterium tuberculosis, Kfycobacterium bovis, Mycobacterium ulcerans, Mycobacterium 
leprae. Actinomyctes israelii, Listeria monocytogenes, Bordetella pertusis. Bordatella parapertusis. 
Bordetella bronchiseptica. Escherichia coli. Shigella dysenteriae, Haemophilus influenzae. Haemophilus 
aegyptius, Haemophilus parainfluenzae. Haemophilus ducreyi, Bordetella, Salmonella typhi, Citrobacter 
freundii, Proteus mirabilis. Proteus vulgaris. Yersinia pestis, Kleibsiella pneumoniae, Serratia marcessens, 
Serratia liquefaciens. Vibrio cholera. Shigella dysenterii, Shigella flexneri, Pseudomonas aeruginosa. 
Franscisella tularensis. Brucella abortis. Bacillus anthracis, Bacillus cereus. Clostridium perfringens, 
Clostridium tetani. Clostridium botulinum. Treponema pallidum, Rickettsia rickettsii and Chlamydia 
trachomitis. (ii) an archaeon, including but not limited to Archaebacter, and (iii) a uniceUular or filamentous 
eukaryote, including but not limited to, a protozoan, a fungus, a member of the genus Saccharorro>ces. 
Kluveromyces. or Candida, and a member of the spedes Saccharomyces ceriviseae, Kluveromyces lactis. or 
Candida albicans. 

"Polynudeotide(s)" generally refers to any polyribonucleotide or polydeoxyribomicleotide, which may 
be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotide{s)" include, without limitation, 
single- and double-stranded DNA, DNA that is a mixture of single- and double-sttanded regions or single-, 
double- and triple-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and 
double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded, or triple-stranded i^ons, or a mixture of single- and double-stranded regions. In 
addition, "polynucleotide" as used herein refers to triple-stranded regions comprising RNA or DNA or both 
RNA and DNA. The strands m such regions may be fixjm the same molecule or from dififerent molecules. The 
regions may include all of one or more of the molecules, but more typicaUy involve only a region of some of the 
molecules. One of the molecules of a triple-helical region often is an oUgonucleotide. As used herein, the tenn 
"polynucleoude(s)" also includes DNAs or RNAs as described above that comprise one or more modified 
bases. Thus, DNAs or RNAs vx-ith backbones modified fbr stabihty or for other reasons are ■•polynucleotide{s)" 
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as that teim is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or 
modified bases, such as tritylated bases, to name just two examples, are polynucleotides as the term is used 
herein. It will be appreciated that a great variety of modifications have been made to DNA and RNA that serve 
many useful purposes known to those of skill in the art. The temi "polynudeotide(s)" as it is aiq)loyed herein 
embraces such chemically, enzymatically or m^abolically modified forms of polynucleotides, as well as the 
chemical fomfis of DNA and RNA characteristic of viruses and cells, including, for example, simple and 
complex cells. "Polynucleotide(s)" also embraces short polynucleotides oftei referred to as oligonucleotide(s). 

"Polypeptide(s)" refers to any peptide or protein comprising two or more amino acids joined to each 
other by peptide bonds or modified pqjtide bonds. "Polypeptide(s)" refers to both short chains, commonly 
referred to as peptides, oligopeptides and oligomers and to longer diains generally referred to as proteins. 
Polypeptides may comprise amino acids other than the 20 goie encoded amino acids. "Polypeptide(s)" include 
those modified either by namral processes, such as processing and other post-translational modifications, but 
also by chemical modification tediniques. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature, and they are well known to those of skill in 
the art. It will be appreciated that the same type of modification may be present in the same or varying degree 
at several sites in a given polypeptide. Also, a given polypqrtide may comprise many types of modifications. 
Modifications can occur anywhere in a polypeptide, including the peptide backbone, the amino acid side^hains, 
and the amino or carboxyl termini. Modifications include, for exanq)le, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachmait of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, gamma- 
carboxylation, GPl anchor formation, hydroxylation, iodination, mediylation, myristoylation, oxidation, 
proteolvtic processing, phosphorylation, prenylation, raconization, giycosylation, lipid attachment, sulfetion, 
gamma-carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation, selenoylation, sulMon, 
transfer-RNA mediated addition of amino acids to proteins, such as arginylation, and ubiquitination. See, for 
instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H, 
Freeman and Company, New York (1993) and Wold, F., Posttranslational Protein Modifications: Perspectives 
and Prospects, pgs. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS^ B. C. 
Johnson, Ed., Academic Press, New York (1983); Sdfter et al., Meth, Emymol 182:626^6 (1990) and 
Rattan et al., Protein Synthesis: Posttranslational Modifications and Aging, Ann. N.Y. Acad Sci. 663: 48-62 
(1992). Poh^peptides may be branched or cyclic with or without branching. Cyclic, branched and branched 
circular polypeptides may result fi-om post-translational natural processes and may be made by entirely 
synthetic methods, as well. 
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"Recombinant expression syston(s)" refers to expression systems or portions thereof or 
polynucleotides of the invention introduced or transfonned into a host ceD or host cell Ivsate for the 
production of the polynucleotides and polypeptides of the invention. 

"Variant(s)" as the term is used herein, is a polynucleotide or polypeptide that differs from a 
reference polynucleotide or polypeptide respectively, but retains essential properties. A typical variant of 
a polynucleotide differs in nucleotide sequence from another, reference polynucleotide. Changes in the 
nucleotide sequence of the variant may or may not alter the amino acid sequence of a polypeptide encoded 
by the reference polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, 
deletions, fusion proteins and truncations in the polypeptide encoded by the reference sequence, as 
discussed below. A typical variant of a polypeptide differs in amino acid sequence from another, 
reference polypeptide. Generally, differences are hmited so that the sequences of the reference 
polypeptide and the variant are closely similar overall and, in many regions, identical. A variant and 
reference polypeptide may differ in amino acid sequence by one or more substitutions, additions, deletions 
in any combination. A substituted or inserted amino acid residue may or may not be one encoded by the 
genetic code. The present invention also includes include variants of each of the polypeptides of the invention, 
that is polypeptides that vary from the referents by conservative amino acid substitutions, whereby a residue is 
substituted by another with like characteristics. Typical sudi substitutions are among Ala, Val, Lw ; jid He; 
among Ser and Thr; among the acidic residues Asp and GIu; among Asn and Gin; and among the basic residues 
Lys and Arg; or aromatic residues Phe and Tyr. Particularly preferred are variants in vAdch several, 5-10, 1-5, 
1-3, 1-2 or 1 amino acids are substituted, deleted, or added in any combination, A variant of a polynucleotide 
or polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant that is not 
known to occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides may be 
made by mutagenesis techniques, by direct synthesis, and by other recombinant methods known to skilled 
artisans. 

EXAMPLES 

The examples below are carried out using standard techniques, whidi are wefl known and routine to 
those of skill in the art, except where otherwise described in detail. The exan?)les are iUustrative, but do not 
limit the invention. 

Example 1 Strain selection, Library Production and Sequencing 

The polynucleotide having a DNA sequence given in Table 1 [SEQ ID N0:1] was obtained from 
a librar>' of clones of chromosomal DNA of Streptococcus pneumoniae in E. coli. The sequencing data 
from two or more clones comprising overlapping Streptococcus pneumoniae DNAs was used to construct 
the contiguous DNA sequence in SEQ ID N0:1. Libraries may be prepared by routine methods, for 
example: 

Methods 1 and 2 below. 



-26- 



wo 99/61452 



PCTAJS99/11704 



Total cellular DNA is isolated from Streptococcus pneumoniae 0100993 according to standard 
procedures and size-fractionated by either of two methods. 
Method 1 

Total cellular DNA is mechanically sheared by passage through a needle in order to size- 
5 fractionate according to standard procedures. DNA fragments of up to 1 Ikbp in size are rendered blunt 
by treatment with exonuclease and DNA polymerase, and EcoRI linkers added. Fragments are ligated into 
the vector Lambda ZapII that has been cut with EcoRI, the library packaged by standard procedures and 
E.coli infected with the packaged library. The library is amplified by standard procedures. 
Method 2 

10 Total cellular DNA is partially hydrolyzed with a one or a combination of restriction enzymes 

appropriate to generate a series of fragments for cloning into library vectors (e.g., Rsal, Pall, Alul, 
Bshl235I), and such fragments are size-fractionated according to standard procedures. EcoRI linkers are 
ligated to the DNA and the fragments then ligated into the vector Lambda ZapII that have been cut with 
EcoRI, the library packaged by standard procedures, and Kcoli infected with the packaged library. The 

1 5 library is amplified by standard procedures. 
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1 An isolated polypeptide selected from the group consisting of: 

(i) an isolated polypeptide comprising an amino acid having at least 95% identity 

to the amino acid sequence of SEQ ID N0:2 over the entire length of SEQ ID N0:2; 

(ii) an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2, 

(iii) an isolated polypeptide which is the amino acid sequence of SEQ ID N0:2, and 

(iv) a polypeptide which is encoded by a recombinant polynucleotide comprising the 

polyncleotide sequence of SEQ ID NO: 1 . 

2. An isolated polynucleotide selected from the group consisting of 

(i) an isolated polynucleotide comprising a polynucleotide sequence encoding a polypeptide that has at 
least 95% identity to the amino acid sequence of SEQ ID N0:2, over the entire length of SEQ ID N0:2; 

(ii) an isolated polynucleotide comprising a polynucleotide sequaice that has at least 95% identitj' 
over its entire length to a polynucleotide sequence encoding the polypeptide of SEQ ID N0:2; 
(lii) an isolated polynucleotide CQrrq)rising a nucleotide sequence which has at least 

95% identity 

to that of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1 ; 

(iv) an isolated polynucleotide conprising a nucleotide sequence encoding the polypeptide of SEQ ID 

N0:2; 

(v) an isolated polynucleotide which is the polynucleotide of SEQ ID NO: 1 ; 

(vi) an isolated polynucleotide obtainable by screening an appropriate library under stringent 
hybridization conditions with a probe having the sequence of SEQ ID NO: 1 or a fragment thereof; 

(vii) an isolated polynucleotide encoding a mature polypeptide expressed by the acpS gene comprised in 
the Streptococcus pneumoniae -, and 

(viii) a polynucleotide sequence complementary to said isolated polynucleotide of (i), (li), (iii), (iv), (v), 
(vi) or (vii). 

3 A method for the treatment of an individual: 

(i) in need of enhanced activity or expression of or immunological response to the 
polypeptide of claim 1 comprising the step of: administering to the individual a therapeutically effective 
amount of an antagonist to said polypeptide; or 

(ii) having need to inhibit activity or expression of the polypeptide of claim 1 
comprising: 
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(a) administering to the individual a therapeutically effective amount of an 

antagonist to said polypeptide; or 

(b) administering to the individual a nucleic acid molecule that inhibits the 

expression of a polynucleotide sequence encoding said polypeptide; 

(c) administering to the individual a therapeutically effective amount of a 
polypeptide that competes with said polypeptide for its ligand, substrate, 

or receptor; or 

(d) administering to the individual an amount of a polypeptide that induces an 
immunological response to said polypeptide in said individual. 

4. A process for diagnosing or prognosing a disease or a susceptibility to a disease in an 
individual related to expression or activity of the polypeptide of claim 1 in an individual comprising the step 

of 

(a) determining the presence or absence of a mutation in the nucleotide sequence encoding said 
polypeptide in an organism in said individual; or 

(b) analyzing for the presence or amount of said polypeptide expression in a sample derived from 
said individual. 

5 A process for producing a polypeptide selected from the group consisting of 
(i) an isolated polypeptide comprising an amino acid sequence selected from the group having at 
least 95% identity 

to the amino acid sequence of SEQ ID N0:2 over the entire length of SEQ ID N0:2; 
(u) an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2: 

(iii) an isolated polypeptide which is the amino acid sequence of SEQ ID N0:2. and 

(iv) a polypeptide which is encoded by a recombinant polynucleotide comprising the 
polynucleotide sequence of SEQ ID NO: 1 , 

comprising the step of culturmg a host cell of claim 7 under conditions sufficient for the 
production of said polypeptide. 

6. A process for producing a host cell comprising an expression system expressing a polypeptide 
selected from the group consisting of 

(i) an isolated polypeptide comprising an amino acid sequence selected from the group having at 
least 95% identity 

to the amino acid sequence of SEQ ID N0:2 over the entire length of SEQ ID N0:2; 
(li) an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2; 
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(iii) an isolated polypeptide which is the amino acid sequence of SEQ ID N0:2, and 

(iv) a polypeptide which is encoded by a recombinant polynucleotide comprising the 
polynucleotide sequence of SEQ ID N0:1, 

said process comprising the step of transforming or transfecting a cell with an expression system 
comprising a polynucleotide capable of producing said polypeptide of (i), (ii), (iii) or (iv) when said 
expression system is present in a compatible host cell such the host cell, under appropriate culture 
conditions, produces said polypeptide of (i), (ii), (iii) or (iv). 

7. A host cell expressing a polypeptide selected from the group consisting of 

(i) an isolated polypeptide comprising an amino acid sequence selected from the group having at 
least 95% identity to the amino acid sequence of SEQ ID N0:2 over the entire length of SEQ ID N0;2; 

(ii) an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2; 

(iii) an isolated polypeptide which is the amino acid sequence of SEQ ID N0:2, and 

(iv) a polypeptide which is encoded by a recombinant polynucleotide comprising the 
polynucleotide sequence of SEQ ID NO:L 

8. An antibody immunospecific for the polypeptide of clr; n 1 . 

9 A method for screening to identify compounds which stimulate or which inhibit the function of the 
polypeptide of claim 1 which comprises a method selected from the group consisting of 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof by means of a label directly or indirectly 
associated with the candidate compound; 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof in the presence of a labeled competitor; 

(c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide, using detection systems qjpropriate to the cells or cell membranes bearing 
the polypeptide; 

(d) mixing a candidate compound with a solution comprising a polypeptide of claim 1, to form a 
mixture, measuring aaivity of the polypeptide in the mixture, and comparing the activity of the mixture to 
a standard; or 

(e) detecting the effect of a candidate compound on the production of mRNA encoding said 
polypeptide and said poljTpeptide in cells, using for instance, an ELISA assay. 

1 0. An agonist or antagonist to the polypeptide of claims 1 to 4. 
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SEQUENCE LISTING 
<110> SmithKline Beecham Corporation 
<120> acpS 



<130> GM10154 



<150> 60/087,079 
<151> 1998-05-28 



<160> 2 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 363 
<212> DNA 

<213> Streptococcus pneximoniae 



<400> 1 

atgatagttg gacacggaat tgacatcgaa gaattggctt cgatagaaag cgcagttaca 60 
cgacatgaag gatttgctaa gcgtgtactg accgctcagg aaatggagcg cttcaccagt 120 
ctcaaaggac gcaggcaaat agaatattta gctggtcgct ggtcggctaa ggaggccttt 180 
tccaaggcta tgggaacggg cattagcaag ctcggttttc aggatttgga agtcttgaac 240 
aatgaacgtg gggcgcctta ttttagtcag gcaccatttt caggaaagat ttggctgtct 
atcagccaca ccgatcagtt tgtgacagcc agtgtcattt tggaggaaaa tcatgaaagc 
tag 



<210> 2 
<211> 120 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 2 

Met He Val Gly His Gly He Asp lie Glu Glu Leu Ala Ser He Glu 

^ S 10 15 

Ser Ala Val Thr Arg His Glu Gly Phe Ala Lys Arg Val Leu Thr Ala 

20 25 30 

Gin Glu Met Glu Arg Phe Thr Ser Leu Lys Gly Arg Arg Gin He Glu 

35 40 45 

Tyr Leu Ala Gly Arg Trp Sex Ala Lys Glu Ala Phe Ser Lys Ala Met 
50 55 60 

- 1 - 
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Giy Thr Gly lie Set Lys Leu Gly 
65 70 
Asn Glu Arg Gly Ala Pro Tyr Phe 
85 

He Trp Leu Ser He Ser His Thr 
100 

He Leu Glu Glu Asn His Glu Ser 
115 120 



Phe Gin Asp Leu Glu Val Leu Asn 

75 80 

Ser Gin Ala Pro Phe Ser Gly Lys 

90 95 

Asp Gin Phe Val Thr Ala Ser Val 
105 110 
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